SEQUENCE LISTING 



GENERAL INFORMATION: 

(i) APPLICANT: Koji YANAI et al. 

(ii) TITLE OF INVENTION: (3-FRUCTOFURANOSIDASE AND ITS GENE, 
METHOD OF ISOLATING p-FRUCTOFURANOSIDASE GENE, SYSTEM 
FOR PRODUCING p-FRUCTOFURANOSIDASE, AND 
3-FRUCTOFURANOSIDASE VARIANT 

(iii) NUMBER OF SEQUENCES: 35 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Wenderoth, Lind & Ponack, L.L.P. 

(B) STREET: 2033 K Street, N.W., Suite 800 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: U.S.A. 

(F) ZIP: 20006 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 mb 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: MS-DOS 

(D) SOFTWARE: WordPerfect 5.1 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: NEW 

(B) FILING DATE: November 23, 2001 

(vii) PRIOR APPLICATION DATA: 



(A) APPLICATION NUMBER: 09/142,623 

(B) FILING DATE: September 10, 1998 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Lee Cheng 

(B) REGISTRATION NUMBER: 40,949 

(C) REFERENCE /DOCKET NUMBER: 2001-1611 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-721-8200 

(B) TELEFAX: 202-721-8250 
(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 5 amino acid residues 

(B) TYPE: amino acid 

(C) STRANDEDNESS : Not relevant 

(D) TOPOLOGY: Not relevant 

(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 

(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1..635 

(C) IDENTIFICATION METHOD: E 



56 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
Ser Tyr His Leu Asp Thr Thr Ala Pro Pro Pro Thr Asn Leu Ser Thr 

15 10 15 

Leu Pro Asn Asn Thr Leu Phe His Val Trp Arg Pro Arg Ala His lie 

20 25 30 

Leu Pro Ala Glu Gly Gin lie Gly Asp Pro Cys Ala His Tyr Thr Asp 

35 40 45 

Pro Ser Thr Gly Leu Phe His Val Gly Phe Leu His Asp Gly Asp Gly 
50 55 60 

J 

% lie Ala Gly Ala Thr Thr Ala Asn Leu Ala Thr Tyr Thr Asp Thr Ser 

y? 

0 65 70 75 80 

ii 

so- 

y Asp Asn Gly Ser Phe Leu lie Gin Pro Gly Gly Lys Asn Asp Pro Val 

11 85 90 95 

■a Ala Val Phe Asp Gly Ala Val lie Pro Val Gly Val Asn Asn Thr Pro 

y ioo 105 no 

id 

P Thr Leu Leu Tyr Thr Ser Val Ser Phe Leu Pro lie His Trp Ser lie 
115 120 125 

Pro Tyr Thr Arg Gly Ser Glu Thr Gin Ser Leu Ala Val Ala Arg Asp 

130 135 140 

Gly Gly Arg Arg Phe Asp Lys Leu Asp Gin Gly Pro Val lie Ala Asp 
145 150 155 160 

His Pro Phe Ala Val Asp Val Thr Ala Phe Arg Asp Pro Phe Val Phe 

165 170 175 

Arg Ser Ala Lys Leu Asp Val Leu Leu Ser Leu Asp Glu Glu Val Ala 
180 185 190 
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Arg Asn Glu Thr Ala Val Gin Gin Ala Val Asp Gly Trp Thr Glu Lys 

195 200 205 

Asn Ala Pro Trp Tyr Val Ala Val Ser Gly Gly Val His Gly Val Gly 

210 215 220 

Pro Ala Gin Phe Leu Tyr Arg Gin Asn Gly Gly Asn Ala Ser Glu Phe 
225 230 235 240 

Gin Tyr Trp Glu Tyr Leu Gly Glu Trp Trp Gin Glu Ala Thr Asn Ser 

245 250 255 

!d Ser Trp Gly Asp Glu Gly Thr Trp Ala Gly Arg Trp Gly Phe Asn Phe 
| 260 265 270 

R Glu Thr Gly Asn Val Leu Phe Leu Thr Glu Glu Gly His Asp Pro Gin 
SS 275 280 285 

4? i 

U Thr Gly Glu Val Phe Val Thr Leu Gly Thr Glu Gly Ser Gly Leu Pro 
!* 290 295 300 

ji He Val Pro Gin Val Ser Ser He His Asp Met Leu Trp Ala Ala Gly 

»"= -Jin 315 320 

}=• 305 310 

Glu Val Gly Val Gly Ser Glu Gin Glu Gly Ala Lys Val Glu Phe Ser 

325 330 335 

Pro Ser Met Ala Gly Phe Leu Asp Trp Gly Phe Ser Ala Tyr Ala Ala 

340 345 350 

Ala Gly Lys Val Leu Pro Ala Ser Ser Ala Val Ser Lys Thr Ser Gly 

355 360 365 

Val Glu Val Asp Arg Tyr Val Ser Phe Val Trp Leu Thr Gly Asp Gin 

370 375 380 
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Tyr Glu Gin Ala Asp Gly Phe Pro Thr Ala Gin Gin Gly Trp Thr Gly 
385 

Leu Pro Arg Glu Leu Lys Val Gin Thr Val Glu Asn Val 



Ser Leu Leu 



405 410 415 



Val Asp Asn Glu Leu Val Arg Glu Glu Gly Val Ser Trp Val Val Gly 

420 425 430 

Glu Ser Asp Asn Gin Thr Ala Arg Leu Arg Thr Leu Gly He Thr He 
435 440 445 

l jJ Ala Arg Glu Thr Lys Ala Ala Leu Leu Ala Asn Gly Ser Val Thr Ala 
5 450 455 460 

1 Glu Glu Asp Arg Thr Leu Gin Thr Ala Ala Val Val Pro Phe Ala Gin 

Ann 475 480 
10 465 470 4/S 

!" Ser Pro Ser Ser Lys Phe Phe Val Leu Thr Ala Gin Leu Glu Phe Pro 

K 485 490 495 

!" Ala Ser Ala Arg Ser Ser Pro Leu Gin Ser Gly Phe Glu lie Leu Ala 

3 500 505 510 

Ser Glu Leu Glu Arg Thr Ala lie Tyr Tyr Gin Phe Ser Asn Glu Ser 



520 525 



515 

Leu Val Val Asp Arg Ser Gin Thr Ser Ala Ala Ala Pro Thr Asn Pro 



530 



535 540 



Gly Leu Asp Ser Phe Thr Glu Ser Gly Lys Leu Arg Leu Phe Asp Val 



560 

545 



550 555 



He Glu Asn Gly Gin Glu Gin Val Glu Thr Leu Asp Leu Thr Val Val 

565 



570 575 



59 



Val Asp Asn Ala Val Val 
580 

Ser Thr Trp Ala Arg Ser 
595 

Phe His Asn Gly Glu Gly 
610 

Glu Gly Leu Tyr Asn Ala 
625 630 



Glu Val Tyr Ala Asn 
585 

Trp Tyr Asp Asn Ser 
600 

Glu Val Gin Phe Arg 
615 

Trp Pro Glu Arg Asn 

635 



Gly Arg Phe Ala Leu 
590 

Thr Gin lie Arg Phe 

605 

Asn Val Ser Val Ser 
620 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1905 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double stranded 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(VX) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2 

(ATCC 20611) 

(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1 1905 

(C) IDENTIFICATION METHOD: E 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



TCATACCACC TGGACACCAC GGCCCCGCCG CCGACCAACC TCAGCACCCT CCCCAACAAC 60 

ACCCTCTTCC ACGTGTGGCG GCCGCGCGCG CACATCCTGC CCGCCGAGGG CCAGATCGGC 120 

GACCCCTGCG CGCACTACAC CGACCCATCC ACCGGCCTCT TCCACGTGGG GTTCCTGCAC 180 

GACGGGGACG GCATCGCGGG CGCCACCACG GCCAACCTGG CCACCTACAC CGATACCTCC 2 40 

GATAACGGGA GCTTCCTGAT CCAGCCGGGC GGGAAGAACG ACCCCGTCGC CGTGTTCGAC 300 

GGCGCCGTCA TCCCCGTCGG CGTCAACAAC ACCCCCACCT TACTCTACAC CTCCGTCTCC 3 60 

TTCCTGCCCA TCCACTGGTC CATCCCCTAC ACCCGCGGCA GCGAGACGCA GTCGTTGGCC 4 20 

GTCGCGCGCG ACGGCGGCCG CCGCTTCGAC AAGCTCGACC AGGGCCCCGT CATCGCCGAC 4 80 

0 CACCCCTTCG CCGTCGACGT CACCGCCTTC CGCGATCCGT TTGTCTTCCG CAGTGCCAAG 540 

*,3 TTGGATGTGC TGCTGTCGTT GGATGAGGAG GTGGCGCGGA ATGAGACGGC CGTGCAGCAG 600 

A 

fl GCCGTCGATG GCTGGACCGA GAAGAACGCC CCCTGGTATG TCGCGGTCTC TGGCGGGGTG 660 

1 ?< i 

15 CACGGCGTCG GGCCCGCGCA GTTCCTCTAC CGCCAGAACG GCGGGAACGC TTCCGAGTTC 720 
CAGTACTGGG AGTACCTCGG GGAGTGGTGG CAGGAGGCGA CCAACTCCAG CTGGGGCGAC 780 
GAGGGCACCT GGGCCGGGCG CTGGGGGTTC AACTTCGAGA CGGGGAATGT GCTCTTCCTC 840 
- , ACCGAGGAGG GCCATGACCC CCAGACGGGC GAGGTGTTCG TCACCCTCGG CACGGAGGGG 900 
R TCTGGCCTGC CAATCGTGCC GCAGGTCTCC AGTATCCACG ATATGCTGTG GGCGGCGGGT 9 60 
GAGGTCGGGG TGGGCAGTGA GCAGGAGGGT GCCAAGGTCG AGTTCTCCCC CTCCATGGCC 1020 
GGGTTTCTGG ACTGGGGGTT CAGCGCCTAC GCTGCGGCGG GCAAGGTGCT GCCGGCCAGC 1080 
TCGGCGGTGT CGAAGACCAG CGGCGTGGAG GTGGATCGGT ATGTCTCGTT CGTCTGGTTG 1140 
ACGGGCGACC AGTACGAGCA GGCGGACGGG TTCCCCACGG CCCAGCAGGG GTGGACGGGG 1200 
TCGCTGCTGC TGCCGCGCGA GCTGAAGGTG CAGACGGTGG AGAACGTCGT CGACAACGAG 12 60 
CTGGTGCGCG AGGAGGGCGT GTCGTGGGTG GTGGGGGAGT CGGACAACCA GACGGCCAGG 1320 
CTGCGCACGC TGGGGATCAC GATCGCCCGG GAGACCAAGG CGGCCCTGCT GGCCAACGGC 13 80 
TCGGTGACCG CGGAGGAGGA CCGCACGCTG CAGACGGCGG CCGTCGTGCC GTTCGCGCAA 1440 
TCGCCGAGCT CCAAGTTCTT CGTGCTGACG GCCCAGCTGG AGTTCCCCGC GAGCGCGCGC 1500 

61 



U j 



If 2 • 



TCGTCCCCGC TCCAGTCCGG GTTCGAAATC 
TACTACCAGT TCAGCAACGA GTCG CTGGTC 
CCCACGAACC CCGGGCTGGA TAGCTTTACT 
ATCGAGAACG GCCAGGAGCA GGTCGAGACG 
GTTGTCGAGG TGTATGCCAA CGGGCGCTTT 
GACAACTCCA CCCAGATCCG CTTCTTCCAC 
GTCTCCGTGT CGGAGGGGCT CTATAACGCC 



CTGGCGTCGG AGCTGGAGCG CACGGCCATC 1560 
GTCGACCGCA GCCAGACTAG TGCGGCGGCG 1620 
GAGTCCGGCA AGTTGCGGTT GTTCGACGTG 1680 
TTGGATCTCA CTGTCGTCGT GGATAACGCG 1740 
GCGTTGAGCA CCTGGGCGAG ATCGTGGTAC 1800 
AACGGCGAGG GCGAGGTGCA GTTCAGGAAT 1860 
TGGCCGGAGA GAAAT 1905 



P (2) INFORMATION FOR SEQ ID NO: 3: 

5 (i) SEQUENCE CHARACTERISTICS: 

□ (A) LENGTH: 20 amino acid residues 



M 



IsSs 
if 



(B) TYPE: Amino acid 

(C) STRANDEDNESS : Not relevant 

(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Peptide 

P ( V ) FRAGMENT TYPE: internal fragment 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
Leu Asp Gin Gly Pro Val He Ala Asp His Pro Phe Ala Val Asp Val 

1 5 10 15 

Thr Ala Phe Arg 
20 
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# 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Peptide 

(v) FRAGMENT TYPE: internal fragment 
O (vi) ORIGINAL SOURCE: 

|| (A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

32 (ATCC 20611) 

M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
* r| Val Glu Phe Ser Pro Ser Met Ala Gly Phe Leu Asp Trp Gly Phe Ser 
15 10 15 

-;Ala Tyr Ala Ala 

y 

(3 20 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Peptide 

(v) FRAGMENT TYPE: internal fragment 
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I 

f'i I 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
Val Gin Thr Val Glu Asn Val Val Asp Asn Glu Leu Val Arg Glu Glu 



1 s 
Gly Val Ser Trp 
20 

S (2) INFORMATION FOR SEQ ID NO: 6: 
jj] ( i) SEQUENCE CHARACTERISTICS: 

JO ( A ) LENGTH: 2 0 amino acid residues 

U 1 

: (B) TYPE: Amino acid 

I* (C) STRANDEDNESS : Not relevant 

IU 

(D) TOPOLOGY: Linear 



10 15 



(ii) MOLECULE TYPE: Peptide 

(v) FRAGMENT TYPE: internal fragment 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Ala Ala Leu Leu Ala Xaa Gly Ser Val Thr Ala Glu Glu Asp Arg Thr 

Leu Gin Thr Ala 
20 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Peptide 

(v) FRAGMENT TYPE: N-terminal fragment 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
m Ser Tyr His Leu Asp Thr 
1 5 



5rf 

13 

10 



p 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATCGCSGAYC AYCCSTTYGC 20 
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(2) INFORMATION FOR SEQ ID NO: 9: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TCRTTRTCSA CSACRTTYTC 2 0 

t f s 

O (2) INFORMATION FOR SEQ ID NO: 10: 
f0 (i) SEQUENCE CHARACTERISTICS: 

in 

s (A) LENGTH: 788 base pairs 

i s (B) TYPE: Nucleic acid 

:^ (C) STRANDEDNESS: Double stranded 

)"l (D) TOPOLOGY: Linear 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Aspergillus niger ACE-2-1 

(ATCC 20611) 

(ix) FEATURE: 

(A) NAME/KEY: P CDS (partial amino acid sequence) 

(B) LOCATION: 1 .. 788 

(C) IDENTIFICATION METHOD: E 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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ATC GCC GAC CAC 
lie Ala Asp His 
1 

TTT GTC TTC CGC 
Phe Val Phe Arg 
20 

GAG GTG GCG CGG 
Glu Val Ala Arg 
P 35 
^3 ACC GAG AAG AAC 

O Thr Glu Lys Asn 

kl 

id 50 

3 SIS 

^ GGC GTC GGG CCC 
|7 Gly Val Gly Pro 

U TCC GAG TTC CAG 
Ser Glu Phe Gin 

ACC AAC TCC AGC 
Thr Asn Ser Ser 
100 

TTC AAC TTC GAG 
Phe Asn Phe Glu 
115 



CCC TTC GCC GTC 
Pro Phe Ala Val 
5 

AGT GCC AAG TTG 
Ser Ala Lys Leu 

AAT GAG ACG GCC 
Asn Glu Thr Ala 
40 

GCC CCC TGG TAT 
Ala Pro Trp Tyr 
55 

GCG CAG TTC CTC 
Ala Gin Phe Leu 
70 

TAC TGG GAG TAC 
Tyr Trp Glu Tyr 
85 

TGG GGC GAC GAG 
Trp Gly Asp Glu 

ACG GGG AAT GTG 
Thr Gly Asn Val 
120 




GAC GTC ACC GCC TTC 
Asp Val Thr Ala Phe 
10 

GAT GTG CTG CTG TCG 
Asp Val Leu Leu Ser 
25 

GTG CAG CAG GCC GTC 
Val Gin Gin Ala Val 

45 

GTC GCG GTC TCT GGC 
Val Ala Val Ser Gly 
60 

TAC CGC CAG AAC GGC 
Tyr Arg Gin Asn Gly 
75 

CTC GGG GAG TGG TGG 
Leu Gly Glu Trp Trp 
90 

GGC ACC TGG GCC GGG 
Gly Thr Trp Ala Gly 
105 

CTC TTC CTC ACC GAG 
Leu Phe Leu Thr Glu 

125 



CGC GAT CCG 48 
Arg Asp Pro 
15 

TTG GAT GAG 9 6 
Leu Asp Glu 
30 

GAT GGC TGG 144 
Asp Gly Trp 

GGG GTG CAC 192 
Gly Val His 

GGG AAC GCT 240 
Gly Asn Ala 
80 

CAG GAG GCG 288 
Gin Glu Ala 
95 

CGC TGG GGG 33 6 
Arg Trp Gly 
110 

GAG GGC CAT 384 
Glu Gly His 
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# 



W 180 185 190 

n 



480 



GAC CCC CAG ACG GGC GAG GTG TTC GTC ACC CTC GGC ACG GAG GGG TCT 432 
Asp Pro Gin Thr Gly Glu Val Phe Val Thr Leu Gly Thr Glu Gly Ser 

130 135 140 

GGC CTG CCA ATC GTG CCG CAG GTC TCC AGT ATC CAC GAT ATG CTG TGG 
Gly Leu Pro lie Val Pro Gin Val Ser Ser He His Asp Met Leu Trp 
145 150 155 160 

GCG GCG GGT GAG GTC GGG GTG GGC AGT GAG CAG GAG GGT GCC AAG GTC 
Ala Ala Gly Glu Val Gly Val Gly Ser Glu Gin Glu Gly Ala Lys Val 
f g 165 170 175 

% GAG TTC TCC CCC TCC ATG GCC GGG TTT CTG GAC TGG GGG TTC AGC GCC 576 
'J, Glu Phe Ser Pro Ser Met Ala Gly Phe Leu Asp Trp Gly Phe Ser Ala 



528 



624 



m 

TAC GCT GCG GCG GGC AAG GTG CTG CCG GCC AGC TCG GCG GTG TCG AAG 
|* Tyr Ala Ala Ala Gly Lys Val Leu Pro Ala Ser Ser Ala Val Ser Lys 
!U 195 200 205 

g ACC AGC GGC GTG GAG GTG GAT CGG TAT GTC TCG TTC GTC TGG TTG ACG 672 
^ Thr Ser Gly Val Glu Val Asp Arg Tyr Val Ser Phe Val Trp Leu Thr 
210 215 220 

GGC GAC CAG TAC GAG CAG GCG GAC GGG TTC CCC ACG GCC CAG CAG GGG 
Gly Asp Gin Tyr Glu Gin Ala Asp Gly Phe Pro Thr Ala Gin Gin Gly 
225 230 235 240 

TGG ACG GGG TCG CTG CTG CTG CCG CGC GAG CTG AAG GTG CAG ACG GTG 
Trp Thr Gly Ser Leu Leu Leu Pro Arg Glu Leu Lys Val Gin Thr Val 

245 250 255 



720 



768 
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GAG AAC GTC GTC GAC AAC GA 
Glu Asn Val Val Asp Asn 
260 



788 



P 
10 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 565 amino acid residues 

(B) TYPE: Amino acid 

(C) STRANDEDNESS : Not relevant 

(D) TOPOLOGY: Not relevant 

(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Penicillium roqueforti IAM7254 

(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1 .. 565 

(C) IDENTIFICATION METHOD: E 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Val Asp Phe His Thr Pro lie Asp Tyr Asn Ser Ala Pro Pro Asn Leu 



15 

r— ■ I J 

1 



5 10 



Ser Thr Leu Ala Asn Ala Ser Leu Phe Lys Thr Tr P Arg Pro Arg Ala 

20 25 30 

His Leu Leu Pro Pro Ser Gly Asn lie Gly Asp Pro Cys Gly His Tyr 



35 
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Thr Asp Pro Lys Thr Gly Leu Phe His Val Gly Trp Leu Tyr Ser Gly 

50 55 60 

He Ser Gly Ala Thr Thr Asp Asp Leu Val Thr Tyr Lys Asp Leu Asn 
65 70 75 80 

Pro Asp Gly Ala Pro Ser He Val Ala Gly Gly Lys Asn Asp Pro Leu 

85 90 95 

Ser Val Phe Asp Gly Ser Val He Pro Ser Gly He Asp Gly Met Pro 
100 105 I 10 

f 3 Thr Leu Leu Tyr Thr Ser Val Ser Tyr Leu Pro He His Trp Ser He 
S 115 120 125 

S Pro Tyr Thr Arg Gly Ser Glu Thr Gin Ser Leu Ala Val Ser Tyr Asp 

W 130 135 140 

£Q 

m Gly Gly His Asn Phe Thr Lys Leu Asn Gin Gly Pro Val He Pro Thr 
|* 145 150 155 160 

i Pro Pro Phe Ala Leu Asn Val Thr Ala Phe Arg Asp Pro Tyr Val Phe 
| 165 170 175 

** Gin Ser Pro He Leu Asp Lys Ser Val Asn Ser Thr Gin Gly Thr Trp 

180 185 190 

Tyr Val Ala He Ser Gly Gly Val His Gly Val Gly Pro Cys Gin Phe 

195 200 205 

Leu Tyr Arg Gin Asn Asp Ala Asp Phe Gin Tyr Trp Glu Tyr Leu Gly 

210 215 220 

Gin Trp Trp Lys Glu Pro Leu Asn Thr Thr Trp Gly Lys Gly Asp Trp 

„ n 235 240 

225 230 z 
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Al. Gly Gly Trp OX, Phe Asn Phe Glu V.1 01, Asn Val Phe Ser Leu 

Asn Ala Glu Gly Tyr Ser Glu Asp Gly Glu He Phe He Thr Leu Gly 
260 

275 

AS p Met Leu Trp Val Thr Gly Asn Val Thr Asn Asp Gly Ser Val Thr 

295 300 
0 Ph e Zs Pro Thr Met Ala Gly Val Leu Asp Trp Gly Val Ser Ala Tyr 

^3 one 320 

W 305 J1U 

.1 A1 a Ala Ala Gly Lys He Leu Pro Ala ser ser Gin Ala Ser Thr Lys 
^ no ->->-» 

ifl t^v. n a cor- Tvr Val Trp Leu Thr Gly Asp 

ser Gly Ala Pro Asp Arg Phe He Ser Tyr vax ±r P 

I* 345 350 

[I Leu Phe Glu Z Val Lys Gly Phe Pro Thr Ala Gin Gin Asn Trp Thr 
Q 55 360 365 

'* Gly Ala Leu Leu Leu Pro Arg Glu Leu Asn Val Arg Thr He Ser Asn 

375 380 
Val Va°i Asp Asn Glu Leu Ser Arg Glu Ser Leu Thr Ser Trp Arg Val 

395 400 

385 390 

405 

He ser Arg Glu Thr Tyr Ser Ala Leu Thr Ser Gly Ser ser Phe Val 



425 430 
420 ,l! = 
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Glu Ser Gly Lys Thr Leu Ser Asn Ala Gly Ala Val Pro Phe Asn Thr 

435 "° 445 

ser Pro Ser Ser Lys Phe Phe Val Leu Thr Ala Asn He Ser Phe Pro 

455 460 
Th r ser Ala Arg Asp Ser Gly He Gin Ala Gly Phe Gin Val Leu Ser 

465 470 47 

Ser ser Leu Glu Ser Thr Thr He Tyr Tyr Gin Phe Ser Asn Glu Ser 

485 «»<> 495 

3 He lie val Asp Arg Ser Asn Thr Ser Ala Ala Ala Arg Thr Thr Ala 

S 500 505 

Q Gly lie Leu Ser Asp Asn Glu Ala Oly Arg Leu Arg Leu Phe Asp Val 
W 515 520 525 

Leu Arg Asn Gly Lys Glu Gin Val Glu Thr Leu Glu Leu Thr lie Val 

I* 535 540 

1 val Asn Ser Val Leu Glu Val Tyr Ala Asn Gly Arg Phe Ala Leu 



P 545 550 555 



560 



r ss& 



Gly Thr Trp Ala Arg 

565 

(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1695 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double stranded 

(D) TOPOLOGY: Linear 
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(ii) 
(vi) 

(ix) 



(xi) 



u 

sa. 



MOLECULE TYPE: Genomic DNA 
ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Penicillium roqueforti IAM7254 
FEATURE: 

(A) NAME /KEY : mat peptide 

(B) LOCATION: 1 1695 

(C) IDENTIFICATION METHOD: E 
SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GTTGATTTCC ATACCCCGAT TGACTATAAC TCGGCTCCGC CAAACCTTTC TACCCTGGCA 
^CGCATCTC TTTTCAAGAC ATGGAGACCC AGAGCCCATC TTCTCCCTCC ATCTGGGAAC 
ATAGGCGACC CGTGCGGGCA CTATACCGAT CCCAAGACTG GTCTCTTCCA CGTGGGTTGG 
CTTTACAGTG GGATTTCGGG AGCGACAACC GACGATCTCG TTACCTATAA AGACCTCAAT 
CCCGATGGAG CCCCGTCAAT TGTTGCAGGA GGAAAGAACG ACCCTCTTTC TGTCTTCGAT 
GGCTCGGTCA TTCCAAGCGG TATAGACGGC ATGCCAACTC TTCTGTATAC CTCTGTATCA 
TA CCTCCCAA TCCACTGGTC CATCCCCTAC ACCCGGGGAA GCGAGACACA ATCCTTGGCC 
GTTTCCTATG ACGGTGGTCA CAACTTCACC AAGCTCAACC AAGGGCCCGT GATCCCTACG 
CCTCCGTTTG CTCTCAATGT CACCGCTTTC CGTGACCCCT ACGTTTTCCA AAGCCCAATT 
CTGGACAAAT CTGTCAATAG TACCCAAGGA ACATGGTATG TCGCCATATC TGGCGGTGTC 
CACGGTGTCG GACCTTGTCA GTTCCTCTAC CGTCAGAACG ACGCAGATTT TCAATATTGG 
GAATATCTCG GGCAATGGTG GAAGGAGCCC CTTAATACCA CTTGGGGAAA GGGTGACTGG 
GCCGGGGGTT GGGGCTTCAA CTTTGAGGTT GGCAACGTCT TTAGTCTGAA TGCAGAGGGG 
TATAGTGAAG ACGGCGAGAT ATTCATAACC CTCGGTGCTG AGGGTTCGGG ACTTCCCATC 
GTTCCTCAAG TCTCCTCTAT TCGCGATATG CTGTGGGTGA CCGGCAATGT CACAAATGAC 
GGCTCTGTCA CTTTCAAGCC AACCATGGCG GGTGTGCTTG ACTGGGGCGT GTCGGCATAT 
GCTGCTGCAG GCAAGATCTT GCCGGCCAGC TCTCAGGCAT CCACAAAGAG CGGTGCCCCC 
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60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



in 



GGCACTTGGG CTCGG 



GGAGATCTAT 


TCGAGCAAGT 


GAAAGGATTC 


1080 


CTCTTACTGC 


CGCGAGAGCT 


GAATGTCCGC 


1140 


TCGCGTGAGT 


CCTTGACATC 


GTGGCGCGTG 


1200 


GAAACAATGG 


GAATCTCAAT 


TTCCAGGGAG 


1260 


TTTGTCGAGT 


CTGGTAAAAC 


GTTGTCGAAT 


1320 


TCAAGCAAGT 


TCTTCGTGCT 


GACAGCAAAT 


1380 


GGCATCCAGG 


CTGGTTTCCA 


GGTTTTATCC 


1440 


CAATTCTCCA 


ACGAGTCCAT 


CATCGTCGAC 




ACTGCTGGGA 


TCCTCAGTGA 


TAACGAGGCG 


1560 


AATGGAAAAG 


AACAGGTTGA 


AACTTTGGAG 


1620 


, GAAGTATATG 


CCAATGGACG 


CTTTGCTCTA 


1680 
1695 



(2) INFORMATION FOR SEQ ID NO: 13: 
r (i) SEQUENCE CHARACTERISTICS: 

10 ( A ) LENGTH: 574 amino acid residues 

13 (B) TYPE: Amino acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY : Not relevant 

(ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(A) ORGANXSM: Microorganism Scopuiariopsis brevicaulis IF04843 

(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1 574 



74 



(C) IDENTIFICATION METHOD: E 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
Gin Pro Thr Ser Leu Ser lie Asp Asn Ser Thr Tyr Pro Ser He Asp 



10 15 



Tyr Asn Ser Ala Pro Pro Asn Leu Ser Thr Leu Ala Asn Asn Ser Leu 
20 

Phe Glu Thr Trp Arg Pro Arg Ala His Val Leu Pro Pro Gin Asn Gin 
35 

0 He Gly Asp Pro Cys Met His Tyr Thr Asp Pro Glu Thr Gly He Phe 
8 50 55 60 

1 His Val Gly Trp Leu Tyr Asn Gly Asn Gly Ala Ser Gly Ala Thr Thr 

W 70 75 80 

16 65 70 

|" Glu Asp Leu Val Thr Tyr Gin Asp Leu Asn Pro Asp Gly Ala Gin Met 
j* 85 90 95 

!3 Ile Leu Pro Gly Gly Val Asn Asp Pro lie Ala Val Phe Asp Gly Ala 
1 100 105 HO 

** Val lie Pro Ser Gly He Asp Gly Lys Pro Thr Met Met Tyr Thr Ser 
115 120 125 

Val Ser Tyr Met Pro lie Ser Trp Ser He Ala Tyr Thr Arg Gly Ser 



130 



135 



140 



Glu Thr His Ser Leu Ala Val Ser Ser Asp Gly Gly Lys Asn Phe Thr 



145 



150 



155 



160 



Lys Leu Val Gin Gly Pro Val He Pro Ser Pro Pro Phe Gly Ala Asn 



165 I 70 175 
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Val Thr Ser Trp Arg Asp Pro Phe Leu Phe Gin Asn Pro Gin Phe Asp 

180 185 190 

Ser Leu Leu Glu Ser Glu Asn Gly Thr Trp Tyr Thr Val He Ser Gly 

195 200 205 

Gly He His Gly Asp Gly Pro Ser Ala Phe Leu Tyr Arg Gin His Asp 

210 215 220 

Pro Asp Phe Gin Tyr Trp Glu Tyr Leu Gly Pro Trp Trp Asn Glu Glu 
225 230 235 240 

Gly Asn ser Thr Trp Gly Ser Gly Asp Trp Ala Gly Arg Trp Gly Tyr 

245 250 255 

Asn Phe Glu Val He Asn He Val Gly Leu Asp Asp Asp Gly Tyr Asn 

260 265 270 

Pro Asp Gly Glu He Phe Ala Thr Val Gly Thr Glu Trp Ser Phe Asp 

275 280 285 

Pro He Lys Pro Gin Ala Ser Asp Asn Arg Glu Met Leu Trp Ala Ala 

290 295 300 

Gly Asn Met Thr Leu Glu Asp Gly Asp He Lys Phe Thr Pro Ser Met 

tin 715 320 

305 310 JJ - 

Ala Gly Tyr Leu Asp Trp Gly Leu Ser Ala Tyr Ala Ala Ala Gly Lys 

325 330 335 

Glu Leu Pro Ala Ser Ser Lys Pro Ser Gin Lys Ser Gly Ala Pro Asp 

340 345 350 
Arg Phe Val Ser Tyr Leu Trp Leu Thr Gly Asp Tyr Phe Glu Gly His 

355 360 365 



Asp Phe Pro Thr Pro Gin Gin Asn Trp Thr Gly Ser Leu Leu Leu Pro 

370 375 380 

Arg Glu Leu Ser Val Gly Thr He Pro Asn Val Val Asp Asn Glu Leu 

, Qn 395 400 

385 390 

Ala Arg Glu Thr Gly Ser Trp Arg Val Gly Thr Asn Asp Thr Gly Val 

405 410 415 

Leu Glu Leu Val Thr Leu Lys Gin Glu He Ala Arg Glu Thr Leu Ala 
420 425 430 

i3 Glu Met Thr Ser Gly Asn Ser Phe Thr Glu Ala Ser Arg Asn Val Ser 
!rj 435 440 445 

Vsw 

I Ser Pro Gly Ser Thr Ala Phe Gin Gin Ser Leu Asp Ser Lys Phe Phe 
W 450 455 460 

^ Val Leu Thr Ala Ser Leu Ser Phe Pro Ser Ser Ala Arg Asp Ser Asp 



\* 465 470 475 480 



? 1 i 



Leu Lys Ala Gly Phe Glu He Leu Ser Ser Glu Phe Glu Ser Thr Thr 

485 490 495 

Val Tyr Tyr Gin Phe Ser Asn Glu Ser He lie He Asp Arg Ser Asn 

500 505 510 

Ser Ser Ala Ala Ala Leu Thr Thr Asp Gly He Asp Thr Arg Asn Glu 

515 520 525 

Phe Gly Lys Met Arg Leu Phe Asp Val Val Glu Gly Asp Gin Glu Arg 

530 535 540 

He Glu Thr Leu Asp Leu Thr He Val Val Asp Asn Ser He Val Glu 
545 



550 555 560 
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Val His Ala Asn Gly Arg Phe Ala Leu Ser Thr Trp Val Arg 

565 570 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double stranded 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microorganism: Scopulariopsis brevicaulis IF04843 

(ix) FEATURE: 

(A) NAME/KEY: mat peptide 

(B) LOCATION: 1 .. 1722 

(C) IDENTIFICATION METHOD: E 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CAACCTACGT CTCTGTCAAT CGACAATTCC ACGTATCCTT CTATCGACTA CAACTCCGCC 
CCTCCAAACC TCTCGACTCT TGCCAACAAC AGCCTCTTCG AGACATGGAG GCCGAGGGCA 
CACGTCCTTC CGCCCCAGAA CCAGATCGGC GATCCGTGTA TGCACTACAC CGACCCCGAG 
ACAGGAATCT TCCACGTCGG CTGGCTGTAC AACGGCAATG GCGCTTCCGG CGCCACGACC 
GAGGATCTCG TCACCTATCA GGATCTCAAC CCCGACGGAG CGCAGATGAT CCTTCCGGGT 
GGTGTGAATG ACCCCATTGC TGTCTTTGAC GGCGCGGTTA TTCCCAGTGG CATTGATGGG 
AAACCCACCA TGATGTATAC CTCGGTGTCA TACATGCCCA TCTCCTGGAG CATCGCTTAC 
ACCAGGGGAA GCGAGACCCA CTCTCTCGCA GTGTCGTCCG ACGGCGGTAA GAACTTCACC 

78 



w 

Id. 



60 
120 
180 
240 
300 
360 
420 
480 



AAGCTGGTGC AGGGCCCCGT CATTCCTTCG CCTCCCTTCG GCGCCAACGT GACCAGCTGG 
CGTGACCCCT TCGTGTTCCA AAACCCCCAG TTCGACTCTC TCCTCGAAAG CGAGAACGGC 
ACGTGGTACA CCGTTATCTC TGGTGGCATC CACGGTGACG GCCCCTCCGC GTTCCTCTAC 
CGTCAGCACG ACCCCGACTT CCAGTACTGG GAGTACCTTG GACCGTGGTG GAACGAGGAA 
GGGAACTCGA CCTGGGGCAG CGGTGACTGG GCTGGCCGGT GGGGCTACAA CTTCGAGGTC 
ATCAACATTG TCGGTCTTGA CGATGATGGC TACAACCCCG ACGGTGAAAT CTTTGCCACG 
GTAGGTACCG AATGGTCGTT TGACCCCATC AAACCGCAGG CCTCGGACAA CAGGGAGATG 
CTCTGGGCCG CGGGCAACAT GACTCTCGAG GACGGCGATA TCAAGTTCAC GCCAAGCATG 
n GCGGGCTACC TCGACTGGGG TCTATCGGCG TATGCCGCCG CTGGCAAGGA GCTGCCCGCT 
1 TCTTCAAAGC CTTCGCAGAA GAGCGGTGCG CCGGACCGGT TCGTGTCGTA CCTGTGGCTC 
I ACCGGTGACT ACTTCGAGGG CCACGACTTC CCCACCCCGC AGCAGAATTG GACCGGCTCG 
% CTTTTGCTTC CGCGTGAGCT GAGCGTCGGG ACGATTCCCA ACGTTGTCGA CAACGAGCTT 
S GCTCGCGAGA CGGGCTCTTG GAGGGTTGGC ACCAACGACA CTGGCGTGCT TGAGCTGGTC 
I* ACTCTGAAGC AGGAGATTGC TCGCGAGACG CTGGCTGAAA TGACCAGCGG CAACTCCTTC 

IUaccgaggcga GCAGGAATGT cagctcgccc ggatctaccg ccttccagca GTCCCTGGAT 
!tccaagttct tcgtcctgac cgcctcgctc tccttccctt CGTCGGCTCG cgactccgac 

CTCAAGGCTG GTTTCGAGAT CCTGTCGTCC GAGTTTGAGT CGACCACGGT CTACTACCAG 
TTTTCCAACG AGTCCATCAT CATTGACCGG AGCAACTCGA GTGCTGCCGC CTTGACTACC 
GATGGAATCG ACACCCGCAA CGAGTTTGGC AAGATGCGCC TGTTTGATGT TGTCGAGGGT 
GACCAGGAGC GTATCGAGAC GCTCGATCTC ACTATTGTGG TTGATAACTC GATCGTTGAG 
GTTCATGCCA ACGGGCGATT CGCTCTGAGC ACTTGGGTTC GG 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GCGAATTCCA ATGAAGCTCA CCACTACC 



1^ 



-3 

. 



In 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16 

lU GCGGATCCCG GTCAATTTCT CTCC 

III 

$ 

I J. 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GACTGACCGG TGTTCATCC 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CTCGGTTGTC ATAGATGTGG 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CAATCCAGGA GGATCCCAAT GAAG 

(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 
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20 
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(A) LENGTH: 2 2 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TGACCGGGAT CCGGGCATGC AG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21 
CGCGTCGTCT AGAGGTTGTC ACTT 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CCCTATTGGG GTCCATGGCC C 

(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 2 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

13 (ii) MOLECULE TYPE: Synthetic DNA 

!i (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

sa. 

QCAACTGCTGG CATCCTCAGT GA 

■i , 5 

3,U 

£ ~k 

T (2) INFORMATION FOR SEQ ID NO: 24: 
[* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 
13 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 
GCGGATCCAT GAAGCTATCA AATGCAATCA 

(2) INFORMATION FOR SEQ ID NO: 25: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 6 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GCGGATCCTT ACCGAGCCCA AGTGCC 

(2) INFORMATION FOR SEQ ID NO: 26: 
(i) SEQUENCE CHARACTERISTICS: 
| (A) LENGTH: 27 base pairs 

r j 

0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 



3 
y 

fi 

f= (D) TOPOLOGY: Linear 



.* (ii) MOLECULE TYPE: Synthetic DNA 

m (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

% GCGGATCCAA TGAAGCTCAC CACTACC 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GCGGATCCCG GTCAATTTCT CTCC 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GTCACCGCCT GGCGCGATCC G 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 

GGCACGGAGT GGTCTGGCC 



(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 24 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
CTCCAGTATC AAGGATATGC TGTG 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

CGACCAGTAC AAGCAGGCGG 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs ' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TCCAGTATCC GCGATATGCT G 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2 3 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 

Is! 

*2 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

fi CGGCACGGAG GTTTCTGGCC TGC 

Id 

W 
to 

\h (2) INFORMATION FOR SEQ ID NO: 34: 
U (i) SEQUENCE CHARACTERISTICS: 

[y (A) LENGTH: 23 base pairs 

% (B) TYPE: Nucleic acid 

(c) STRANDEDNESS: Not relevant 
(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 
CGGCACGGAG GAGTCTGGCC TGC 

(2) INFORMATION FOR SEQ ID NO: 35: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 base pairs 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Not relevant 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Synthetic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35 
CGGCACGGAG GATTCTGGCC TGC 
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